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Abstract. We show that a special case of the Feferman-Vaught composition theorem 
gives rise to a natural notion of automata for finite words over an infinite alphabet, with 
good closure and decidability properties, as well as several logical characterizations. We 
also consider a slight extension of the Feferman-Vaught formalism which allows to express 
more relations between component values (such as equality) , and prove related decidability 
results. From this result we get new classes of decidable logics for words over an infinite 
alphabet. 



Introduction 

The problem of finding suitable notions of automata for words over an infinite alphabet 
has been adressed in several papers [H [HI [3l O [23|- The motivations are e.g. mod- 
elization of temporized systems, distributed systems, or manipulation of semi-structured 
data. A common goal is to find a simple and expressive model which preserves as much 
as possible the good properties of the classical model. Kaminski and Francez [16] intro- 
duce finite-memory automata: these are finite automata equipped with a finite number of 
registers which allow to store symbols during the run, and compare them with the current 
symbol. The paper [5j extends somehow this idea by allowing transitions which involve 
an equivalence relation of finite index defined on the set of (vector) values of the registers. 
The paper [21] continues the study of finite-memory automata, and also introduce pebble 
automata, which are automata equipped with a finite set of pebbles whose use is restricted 
by a stack discipline. The automaton can test equality by comparing the pebbled symbols. 
The work [3] addresses decidability issues for some fragment of first-order logic which allows 
to express properties of words over an infinite alphabet, and introduces a related notion of 
automaton. More recently, Choffrut and Grigorieff [10] define automata whose transitions 
are expressed as first-order formulas (see below). Let us also mention the work [12] which 
studies variants of constraint LTL over infinite domains. 
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The aim of this paper is to show that a special case of the Feferman-Vaught composition 
theorem gives rise to a natural notion of automata for finite words over an infinite alphabet, 
with good closure and decidability properties, as well as several logical characterizations. 
Building on Mostowski's work [23], Feferman and Vaught consider in [TJ] several kinds 
of products of logical structures, and prove that the first-order (shortly: FO) theory of a 
(generalized) product of structures reduces to the FO theory of the factor structures and the 
monadic second-order (shortly: MSO) theory of the index structure. We refer the interested 
reader to the survey papers [2]j i32j which present several applications of these results, as 
well as extensions of the technique; for recent related results see e.g. [261 1271 [33], 

An interesting special case of the Feferman-Vaught (shortly: FV) theorem is when one 
considers the generalized weak power of a single structure 9Jt, and the index structure is 
(w; <). In this case the domain of the resulting structure roughly consists in the set of 
finite words over the domain of SOT (seen as an alphabet), and the definable relations can 
be characterized in terms of automata thanks to Biichi-Elgot-Trakhtenbrot results on the 
equivalence between definability in the MSO theory of (w; <) and automata. The automata 
model and related logics we consider can be seen as direct reformulations of this special 
case. Note that the connection between automata and products of structures was already 
explored in [2], where it is shown that automatic structures are closed under finite products. 

For the sake of readability, in the paper we first introduce the automata model and prove 
some of its properties, and then put in evidence the connection with the Feferman-Vaught 
construction. 

In Section [2] we define the automata model. Given a structure 9Jl with domain S 
(finite or not), we define OJt-automata as multitape synchronous finite automata which 
read finite words over E, and whose transitions are labelled by first-order formulas in the 
language of OJt. We show that the class of relations recognizable by such automata (which 
are called 9Jt-recognizable relations) are closed under boolean and rational operations, as 
well as projection, and that the emptiness problem is decidable whenever the FO theory 
of 9Jt is. These results are straightforward generalizations of the classical case of a finite 
alphabet. 

In Section [3] we provide two logical characterizations of 9Jt-recognizable languages. The 
first one uses MSO logic and is an easy adaptation of Biichi's classical result [7]. For the 
second one, we first introduce the notion of ^M- automatic structures which extends the 
notion of automatic structures, and prove some basic related results. Then we extend the 
Eilenberg-Elgot-Shepherdson FO formalism [13] for synchronous relations over words to the 
case of 9Jl-recognizable relations. This result, and actually the automaton model itself, are 
a natural generalization of Choffrut and Grigorieff results mentioned above [10] . 




Several results of Section [2] and [3] are rather easy generalizations or reformulations of 
well-known results; therefore many proofs in these sections are only sketched. 

In Section[3]we recall useful notions and results about products and powers of structures, 
then show the close relationship between 9Jt-recognizability and definability in generalized 
weak powers. This allows to revisit all previous results in the light of the Feferman-Vaught 
framework. 

Section [5] presents some applications. We first apply the previous ideas to improve 
a recent result by Kuske and Lohrey [19j related to the monadic chain logic of iteration 
structures; this application was brought to our attention by Wolfgang Thomas. In the 
second part of the section, we provide a logical characterization of 9Jt-recognizable relations 
for the special case where dJt = (a;; -|-), in terms of ordinal theories. 
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In terms of expressive power, 9Jt-automata are incomparable with automata and logics 
considered in [3l \W[ I24j . since on one hand they allow to express FO constraints, but on 
the other hand they cannot test whether two positions in a word carry the same symbol 
(for instance, the language {aa | a € S} is not 9Jt-recognizable whenever S is infinite, see 
Example I2.5p . As shown e.g. in [3l [23] these kinds of tests have to be limited if one wants 
to keep good decidability properties. In Section [6] we propose a slight extension of the 
Feferman-Vaught formalism which allows to test whether an n— tuple si, . . . , s„ of symbols 
appearing in distinct positions in a word w, satisfies a formula . . . , rE„) in 9Jt. We 

isolate a syntactic fragment of this logic, which we denote by M50^(£), for which the 
satisfiability problem (or in other words, the emptiness problem for related languages) still 
reduces to the decidability of the FO theory of 971. 

1. Definitions and notations 

In the sequel we deal with finite words over some alphabet, finite or not. Given an 
alphabet S (finite or not) we denote by S* (respectively Ti^) the set of finite words (respec- 
tively w— words) over S. The empty word is denoted by e, and the length of a finite word 
w by \w\. Given a word w € T,* with length n, we denote by w[i] the i— th symbol of w 
(starting from i = 0). We shall say that the position i carries w[i]. 

We consider several logical formalisms. By FO we mean first-order logic with equality. 
We shall also consider Monadic Second-Order Logic (shortly: MSO). We denote by FO{dJt) 
(respectively MS'0(9Jt)) the first-order (respectively monadic second-order) theory of the 
structure 971. We consider only relational structures. Given a language £ and a ^-structure 
9Jt, for every relational symbol -R of £ we denote by the interpretation of R in DJl. 
However, we will often confuse logical symbols with their interpretation. Moreover we will 
use freely abbreviations such as 3x £ X (p. 

We shall deal with multitape synchronous automata. As usual, given n finite words 
{wi, . . . , Wn) over S, we introduce a padding symbol and we complete (if necessary) each 
Wi with a sufficient number of #'s in order to have words of the same length. Doing this, 
we obtain n words over S U with the same length, which can be seen as a single word 
over the alphabet (S U {#})" (i.e. the alphabet of n— tuples of elements of S U {#}). This 
word will be denoted by {wi, . . . , u)„). 

Consider a relational language £ and a ^-structure 97T with domain S. Since we have to 
deal with the symbol # we shall associate to 971 the structure 97t# in the extended language 
£# = -C U {P#}, such that: 

• the domain of 97t^ is S U {#}; 

• for every relational symbol of £, we have R"^ = E™; 

• P#{x) holds in 97t# if and only if x = 

2. Definition and properties of 97t-AUTOMATA 

In this section we introduce the notion of 97t-automata and 97T-recognizable relations, 
and prove some basic results. 

Let S denote an alphabet, finite or not, and let 97T denote an £— structure with domain 
S. An 'tffC- automaton is a finite n— tape synchronous non-deterministic automaton which 
reads finite words over S. Transition rules are triplets of the form {q,Lp,q'), where q,q' are 
states of the automaton, and ip{xi, . . . ,Xn) is a first-order formula in the language £^ of 
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The transition {q,^,q') can be executed if the n— tuple of current symbols read by 
the n heads satisfies ip in 

Definition 2.1. Let S be an alphabet and let 5Jt denote an £— structure with domain E. 
An 9K-automaton is defined as a 7— tuple A = (Q, n, S, Tl, E, I, T) where 

• Q is a finite set (of states); 

• n > 1 is the number of tapes; 

• E Q X Tn X Q is the set of transitions, where Tn denotes the set of £^-formulas with 
n free variables; 

• / C Q is the set of initial states; 

• T C Q is the set of terminal states. 

Given an n— tuple w = {wi, . . . , Wn) of words over S, a path j in A labeled by {w) is a 
sequence of states 7 = (go, • • • , ^m), such that m = \ {w)\, qo G /, and for every i < m there 
exists a £#-formula ^{xi, . . . , Xn) such that {qi, ip, qi+i) € E and 



where Trj{{w)) denotes the j— th component of {w). The path 7 is successful if qm G T. We 
say that w is accepted by A if {w) is the label of some successful path. We denote by L{A) 
the set of words w G (S*)" which are accepted by A. 

Definition 2.2. Let n > 1. A relation X C (S*)" is said to be dyt-recognizable if and only 
if there exists an 9Jl-automaton A with n tapes such that X = L{A). 

Example 2.3. Let 971 = {uj; +) where + denotes the graph of addition. The following 
relations are 9Jt-recognizable: 

(1) the set of words over co (seen as as an infinite alphabet) of the form (1,0, ...,0) (we 
allow the case where there is no 0). Consider indeed the 9Jt-automaton with two 
states qo,qi, where qo is initial and qi is terminal, and whose set of transitions is 
{(90; V'l) 9i); (fZi; VO; 91)} where ipo{x) is the formula x + x = x, and (pi{x) expresses that 
X = 1. The automaton is pictured in Figured) 



(2) the set of words over iv whose symbols are alternatively even and odd. Consider indeed 
the 9K-automaton with two states qo,qi, where qo,qi both are initial and terminal 
states, and whose set of transitions is {{qo, ipe,qi), {qi, ^fo, qo)} where ipe{x) is the formula 
3z z + z = X, and ipo{x) = ^(pe{x). 

(3) the relation L C u* x uj* x uj* defined by {u, v,w) G L if and only if n, v, w have the 
same length, and moreover for every i the i— th symbol of w equals the sum of the 
corresponding symbols of u and v. Consider indeed the 9Jt-automaton with a single 
state q (which is initial and terminal) and a single transition {q,(p,q) where (p{x,y,z) 
is the formula x + y = z. Observe that if u, v and w do not have the same length then 
the last letter of {u,v,w), say {um,Vm,Wm), has at least one component which is equal 



m#^ip{7T,{{wm,...,7Tn{{wm) 




Figure 1: A simple 9Jl-automaton 
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to which by the very definition of imphes ^ ip{um,Vm,Wm), which imphes 
in turn that there is no (successful) run of A labelled by {ujVjw). 

Example 2.4. Let Tl = (S; {Pa)aeT;), where Pa{x) holds if and only if x = a. One can 
show that if S is finite then 9Jt-recognizable relations coincide with synchronous relations 
(as defined in [13]). 

Example 2.5. For every Xl-structure 971 = (S; ...) such that S is infinite, the language 
X = {aa I a € S} is not 9Jt-recognizable. Indeed assume for a contradiction that there exists 
some SUt-automaton A = {Q, n, S, 9Jt, E, I, T) which accepts X. Since X is infinite and E is 
finite, there exists an infinite subset of X whose elements admit a common successful path. 
More precisely, there exist two transitions {qo,^i,qi),{qi,f2,Q2) £ E such that qq G /, 
q2 € T, and infinitely many elements a G S satisfy 9Jt# |= ^pi{a) and |= (P2{cl). Thus 
there exist at least two distinct elements ai ^ 02 such that 9Jt# \= (pi{ai) and \= ip2{a2), 
which implies that aia2 is accepted by A, and this leads to a contradiction. 

The closure properties of synchronous relations still hold for SDT-recognizable relations. 

Proposition 2.6. The class of '^-recognizable relations is closed under 

(1) boolean operations; 

(2) cylindrification; 

(3) projection. 

Proof, (sketch) (1) the closure under union is a straightforward adaptation of the classical 
construction for non-deterministic automata. Let us outline the proof for the closure under 
complementation. Assume that the relation R C (S*)" is recognized by the 9Jt-automaton 
A. Let 931, ... , denote the formulas which appear in the transitions of A. Consider, for 
every subset J C {1, . . . , m}, the formula ipj : AieJ ^ f\i^j "^^^ SDT-automaton A' 
which is defined from A by replacing every transition [q, ipi, q') G E with all transitions of 
the form {q,ipj,q') where i (z J, also recognizes R. Moreover A' can be seen as a classical 
non-deterministic automaton over the finite "alphabet" of formulas ipj, and thus it can 
be determinized, i.e. transformed into an equivalent 9Jt-automaton A" whose transitions 
involve the formulas ijjj, and such that for every state q and every formula ipj there exists 
a single transition of A" the form {q,ipj,q'). Now one can use the usual construction for 
complementation of deterministic automata, i.e. turn non-terminal states to terminal states 
and conversely, and get an 9Jt-automaton which recognizes (S*)" \ R. 
(2) is straightforward. 

For (3), in order to recognize the projection of i? C (S*)", say over the n — 1 first 
components, it suffices to replace, in the 9Jt-automaton which recognizes R, all transi- 
tions {q,ip{xi, . . . ,Xn),q') with transitions {q,3xn^{xi, . . . ,Xn),q'). D 

Regarding the emptiness problem for 971-recognizable languages, the main difference 
with the classical case is that in an OJl-automaton A there can exist transitions {q, ip, q') € 
E such that no n— tuple of elements of 9Jl^ satisfies 99; such transitions will never be 
executed by A. Thus one has to remove such transitions from E in order to apply the usual 
reachability algorithm for the emptiness problem; this can be done effectively if and only if 
FO(9Jl#) is decidable. Since FO(9Jt^) and FOi^) are reducible to each other, we get the 
following result. 

Proposition 2.7. The decidability of the emptiness problem for ^-recognizable languages 
is equivalent to the decidability of FO{dyt). 
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3. Logic and 9Jt-AUTOMATA 

There exist three important logical formalisms which capture automata: 

• Biichi-Elgot-Trakhtenbrot MSO logic (see [7]), i.e. the weak monadic second order theory 
of {uj, <); 

• the Eilenberg-Elgot-Shepherdson (shortly: EES) formalism |13] . i.e. the FO theory of 
S = (T,*;EqLength,^,{La}aes) where 

- EqLength{x, y) holds if and only if x and y have the same length 

- X ^ y holds if and only if x is a prefix of y 

- La{x) holds if and only if a is the last letter of x. 

• the so-called Biichi Arithmetic of base k, i.e. the FO theory of the structure (u;;+, T4) 
where Vk{x) denotes the greatest power of k which divides x, see [6]. 

In this section we extend the two first formalisms to the case of words over any alphabet 
(finite or not). In order to extend the EES formalism, we introduce the notion of 
automatic structure, which generalizes the one of automatic structures. 

In Section [5.21 we will prove that for 9Jt = [uo] +), the class of OJt-recognizable relations 
corresponds to the class of relations definable in the structure (o;"^; +). The latter structure 
can therefore be seen as "Biichi Arithmetic of base cj" . 

3.1. Monadic Second-Order Logic. 

Biichi, Elgot and Trakhtenbrot prove that languages of words definable by MSO logic 
coincide with regular languages (see [33]). As an example if S = {a, 5} then one can 
characterize the set of words w G aa*b* in MSO logic with the formula 

3x{Qa{x) A V2/((x < y ^ Qb{y)) A {y < x ^ Qa{y)))) 

where the (first-order) variables x, y are interpreted as positions in the word, < denotes the 
natural ordering of positions, and Qs{y) holds if and only if the y— th position in the word 
carries the symbol s. The unary predicates Qa and Qi^ express properties related to elements 
of S. These properties are actually first-order definable in the structure 9Jt = (S;Pa,Pfe). 
We shall extend this formalism by considering any structure 9Jt with domain S (finite or 
not) and adding to the MSO formalism unary predicates apix) which express that the 
symbol at position x satisfies the formula F in 9Jt. 

More formally, let OJt = (S;...) be an £-structure. We associate to every £#-formula 
F some unary relational symbol ap- We define then MSO{2?j as MSO over the language 
{<, (ai7')i?(=jr} where T denotes the set of (first-order) £#-formulas with at least one free 
variable. 

Definition 3.1. We say that A C (S*)" is M SO {m)- definable if there exists an MSO{£)- 
sentence ip such that w = {wi, . . . ,Wn) € ^ if and only if 

{D,<D,iaF)Fe:F) \= V' 

where 

• D = {0, 1, . . . , \ {w)\ — 1}, and <_d is the natural ordering relation restricted to D; 

• For every £^-formula F with n free variables, and every position x in w, the formula 
apix) holds in (D, <£>, {aF)F&r) if and only if 

m#^F{T,i{{w))[xl...,Tln{{w))[x]), 

where 7ri{{w)) denotes the i— th component of (w). 
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Example 3.2. Let Tl = (w; +). 

• the set A <^ uj* of words that contain only even symbols is MS*© (9Jt)-definable by the 
sentence Vy apiy), where F{x) : 3z{z + z = x) 

• the set B C. oj* oi words whose symbols are alternatively even and odd is MSO{^)- 
definable by the formula 

VxVy((x < y A -i3z x < z < y) — > {[apix) <-> a-,i?(y)))) 

where F is the formula defined above. 

Example 3.3. Let Tl = (S; (Pa)aGs)- If S is finite, then MS'O (9}T)-definable languages 
coincide with languages definable in Biichi's MSO logic. Indeed in this case it is rather easy 
to prove that any formula F{xi, . . . ,Xn) in the language of 971 is equivalent to a boolean 
combination of formulas of the form Pa{xi), which implies in turn that every predicate 
apiv) is equivalent to a boolean combination of predicates Qa{y)- 

We now generalize the equivalence between recognizability and definability to the case 
of any alphabet S (finite or not). 

Proposition 3.4. Let ^JOl be a structure with domain T,. For every n > 1 and every relation 
R C (S*)", the relation R is M SO {^JOl) -definable if and only if it is ^-recognizable. 

Proof. The proof is a simple adaptation from Biichi's equivalence between WMSO defin- 
ability and recognizability [7] . For the direction from recognizability to definability one uses 
the Biichi's technique of encoding of an accepting run of an automaton by a formula. For 
the converse one uses the fact that the formulas a^i, • • • ,aFm appearing in an MS0[2)- 
sentence ■0 can be chosen such that every n— tuple of elements of the domain of 9Jt^ satisfies 
exactly one formula among the FiS in 9?l#, which allows then to see words over S U {7^} 
as words over the finite alphabet {1, . . . , m} and then prove the result by induction on the 
construction oi ip. □ 



3.2. 9Jt-automatic structures and an extension of the EES formalism. Automatic 
structures (see [l5l[I7l[2]) are relational structures which can be presented by finite automata 
over a finite alphabet. 

Definition 3.5. The structure = {N; Ri, . . . , Rk) is said to be automatic if there exist 
a finite alphabet S and an injective mapping fi : N ^ Y,* such that the images by ^ of 
N, Ri, . . . , Rf: are synchronous relations. 

The fundamental result about automatic structures is the following. 

Theorem 3.6 ([IS]). 7/9^ is automatic then: 

(1) the image by // of every relation definable in ^ is a synchronous relation; 

(2) FO{m) is decidable. 

We can generalize the previous notions and results to the case of an infinite alphabet. 

Definition 3.7. Let 9Jl = (S; . . . ) and = (A^; Ri, . . . , Rk) be two structures. We say that 
is dJl-automatic if there exists an injective mapping ^ : N ^T,* such that the images by 
fi oi N, Ri, . . . , Rf^ are 9Jl-recognizable relations. 
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Example 3.8. Let 97t = (a;;+), and let ^ = {to \ {0}; x) where x denotes the graph of 
multipUcation. The structure (which is often called Skolem arithmetic) is 9Jl-automatic. 
Consider indeed the function ^ : {uj \ {0}) — > oo* which maps every natural number n > 1 
whose prime decomposition is n = Pg^p"^ ...p^'', where pi denotes the i— th prime and 
rifc 7^ 0, to the word = riQni . . . n^. Moreover let /u(l) = e. It is not difficult to check 
that the image by ^ of w \ {0} and x are 9Jt-recognizable. 

One can prove rather easily that if S is finite and 9Jt = (S; (Pa)aeE), then 9Jl-automatic 
structures correspond to automatic structures. On the other hand, there exist structures 
which are 9Jl-automatic but not automatic. For instance in [2] it is proven that the structure 
^ considered in the previous example, i.e. Skolem arithmetic, is not automaticEI. 

We can extend Theorem 13.61 in the following way. 

Theorem 3.9. is ^-automatic then: 

(1) the image by /x of every relation definable in is Tl-recognizable; 

(2) FO{m) reduces to FO{dJt). 

Proof, (sketch) This is a straightforward adaptation of the proof of Theorem 13.61 For (1) 
one proceeds by induction on the formulas and use the closure properties of 9Jt-recognizable 
relations as stated in Proposition l2.6[ For (2), one first deduces from (1) that the decidability 
of -FO(DT) reduces to the decidability of the emptiness problem for 9Jt-automata, and then 
uses Proposition 12.71 □ 

Let us now turn to the first-order characterization of 9}T-recognizability. Eilenberg, 
Elgot and Shepherdson prove the following result. 

Theorem 3.10 ([H]). Let S denote a finite alphabet with at least two elements. For every 
n > 1, an n—ary relation over T, is synchronous if and only if it is definable in the structure 

S = {T.*;EqLength, ^, {Lajags) 

where 

• EqLength{x,y) holds if and only if x and y have the same length; 

• X <y holds if and only if x is a prefix of y; 

• La{x) holds if and only if a is the last letter of x. 

It is easy to check that for every finite alphabet S the structure iS is automatic, and 
moreover one can deduce from the above theorem that every automatic structure is indeed 
FO— interpretable in S (see e.g. [2]). 

In [13], the authors asked whether there is an appropriate notion of automata that 
captures this logic when S is infinite. Choffrut and Grigorieff [10] recently solved this 
problem (and also other questions raised in [13]) by introducing a notion of automata with 
constraints expressed as FO formulas. It appears that the automata notion they consider 
captures exactly 9Jl-recognizable relations for the special case OH = (S; (i-*a)aes)- 

We can generalize the previous results in the following way. 

Definition 3.11. Let = (S; . . . , R^) be a structure. We define the stucture 

= {^*; EqLength, ^, Ar^ , . . . , Ar^ , A=) 

-'^Note that Skolem arithmetic is tree-automatic in the sense of [2]. This comes from the fact that the 
structure DJl, i.e. Presburger arithmetic, is automatic. More generaUy one can prove that if 971 is an automatic 
structure and 9^ is lOT-automatic then 9^ is tree-automatic. 
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where 

• EqLength{x, y) holds if and only if x and y have the same length; 

• X <y holds if and only if x is a prefix of y; 

• for every i, A/j. (xi, . . . holds if and only if there exist words wi, . . . ,Wn G S* and 
symbols ai, . . . , a„ G S such that: 

- Xi = Witti for every i; 

- all Wj's have the same length; 

- (ai,...,a„) Gijf. 

• A={x, y) holds if and only if x and y have the same length and the same last letter. 

Theorem 3.12. Let dJt = (S;i?i, . . . ,Rk) be a structure, where |S| > 2. 

(1) For every n > 1 and every n—ary relation R over the relation R is ^-recognizable 
if and only if it is definable in S^; 

(2) FOiSm) reduces to FO{Tl). 

Proof. 

(1) It is easy to check that all base relations of S^^i are 9Jt-recognizable, which implies 
that Srjji is 9Jt-automatic, and thus by Theorem 13.91 every relation definable in is 
9Jt-r ecognizable . 

For the converse one can adapt again Biichi's technique of encoding runs of automata 
by monadic second-order variables. Assume that R C (S*)" is recognized by some 
automaton A = {Q,n,T,,Tl, E, I,T) whose set of states is Q = {qq, . . . ,qk}- We can 
define R in Sgjt by a formula ip{wi, . . . , Wn) which expresses the existence of a successful 
path of A, say {qjo^qji, ■ ■ ■ ,qj^), labelled by {vui, . . . ,Wn)- The formula encodes the 
path with k + 2 words zq, zi, . . . , zi^^y whose length is m-|- 1 (that is, \{wi, . . . , tUn)| + 1), 
and such that for every i, the word Zj. is the only word among zq, zi, . . . , whose i— th 
symbol equals the i—th symbol of y. That is, the variable y serves to identify Zj. for 
every position i (with the help of the predicate A=). 

(2) This is an immediate consequence of Theorem 13.91 together with the fact that S<xit is 
9Jl-automatic. 

□ 

In Section [5] we will improve item (2) of the above theorem by proving that even the 
monadic second-order chain logic of Sfxn reduces to FO{TV). 

4. A SPECIAL CASE OF THE FeFERMAN-VAUGHT COMPOSITION THEOREM 

In this section we put in evidence the strong relationship between 97t-automata and a 
special case of the Feferman-Vaught composition theorem. The Feferman-Vaught method 
presented in [14] generalizes Mostowski's work [23] about products of structures. Let us 
recall some useful notions and results from [231 [T3] • 

Definition 4.1. Let Tl = (S; Rf^, . . . , R^) be a structure, and let / be a non empty set. 
The direct power of a 9Jt with respect to / is defined as the structure 

m=i^';RT,...,Rf) 

such that 

• the domain of is the set of sequences / : / — > S 
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• for every j € {1, . . . , k}, if Rj is n— ary then for every n— tuple (/i, . . . , /„) of elements of 
S^, we have (/i, ...,/„) G i?J^ if and only if . . . , /n(«)) € iJ™ for every i G /. 

Example 4.2. Let 9Jt = (w; +), and I = lo. The direct power of SOT with respect to / is the 
structure with domain the set of sequences / : a; — > a; (in other words, the set of w— words 
over the alphabet w), and such that (/i, /2, /s) G +^ if and only if + /2(^) = fz{i) for 
every i ^ uj. 

Mostowski proves that the evaluation of FO formulas in the direct power 91 reduces 
to the evaluation of formulas in the factor structure OJt and formulas in the structure © = 
(5(1); C) (the index structure) where S{I) denotes the power set of /, and C is interpreted 
as the inclusion relation. Note that the FO theory of S is a variant of the MSO theory of 

Definition 4.3. Let R be an m— ary relation over elements of S^. A reduction sequence 
for R (with respect to the structures 9Jt, S) is a sequence ^ = {G, 6i, . . . ,6i) such that 

• G is a formula in the language of S; 

• 6i, . . . ,01 are formulas in the language of Tl; 

• for every m-tuple (/i, . . . , fm) of elements of S^, we have (/i, . . . , fm) G i? if and only if 

& ^ G{Ti, . . . ,Ti) 

where 

T, = {xel \m^eiih{x),...,fUx))} for every ie{l,...,l}. 
Example 14.21 (continued). 

• The base relation +^ of 91 admits a reduction sequence with respect to 971 = {uj; +) and 
& = {S{u!);C). Indeed we have (/i, /2, /s) G +^ if and only if the set of indexes i such 
that fi{i) + f2{i) = fsii) equals lo, that is if 

where 

T = {ieio\m^ fiii) + f2{i) = fsm. 

Thus +^ admits the reduction sequence ^ = {G,9i), where G{X) : VY Y C X and 
ei{x,y,z) ■.x + y = z. 

More generally all base relations of a direct power of a structure 971 admit a reduction 
sequence. 

• Consider the formula 

F{x) : yyiiy2{(3zi yi + zi = x A 3z2 y2 + Z2 = x) ^ 3u(yi + u = y2'V y2 + u = yi)) 

We have 9T |= F{f) if and only if / admits at most one non-null element /(«), that is, 
if the set of elements i such that f{i) + f{i) ^ f{i) contains at most one element. The 
unary relation defined by F admits the reduction sequence ^ = (G, ^i) where 

G{x) : VyVZ((y C X AZ C X) ^ {Y C ZV Z CY)) 

and 9i{x) : x + x ^ x. 

Theorem 4.4 (Mostowski [23]). Let dJt be a structure, I be a non empty set, and let 9t be 
the direct power of dJt with respect to /. Then 
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(1) one can compute effectively a reduction sequence for every relation which is FO definable 
in OT; 

(2) the FO theory o/DT reduces to the FO theories ofdJt and &. 

Mostowski also proves that F0{&) is decidable for every set / (by elimination of quan- 
tifiers), which together with point (2) in the above theorem implies that for every / the FO 
theory of the power of 9Jt with respect to / reduces to the FO theory of 

Example 14.21 (continued). The FO theory of is decidable, since it reduces to the FO 
theory of (w; +) which is decidable [25] . 

Another important notion is the one of weak direct power of a structure. In this variant 
we consider a structure 971 = (S; Rf^, . . . , R^, P^) with some distinguished element e € S, 
and where Pe{x) holds in DJl if and only if a; = e. The weak power of SDT with respect to / 
is defined in the same way as in Definition 14.11 but here the domain of 0^ is the set SD^i^'' of 
sequences f : I ^ T, such that f{i) ^ e for finitely many values of i. Mostowski proves that 
Theorem 14.41 still holds for weak direct powers, with the following modifications: 

• for the index structure 6 one considers the structure 6/m = (5'+(/);C) where S~^{I) 
denotes the set of finite subsets of /. 

• one considers only reduction sequences = (G, 9i, . . . ,9i) such that Tl \= -^6i{e, . . . , e) 
for every i (this condition ensures that all sets Tj in Definition 14.31 are finite). 

Note that &fin is a FO variant of the weak MSO theory of /. In [23] it is shown that 
FO{6fin) is decidable for every set /. Therefore FO(DT) reduces to FO(JM). 

Example 4.5. (Decidability of Skolem arithmetic [23j) We revisit here Example 13.81 Con- 
sider the structure 971 = {uj;+,Po) where is the distinguished element, and I = lo. Then 
the weak direct power of 971 with respect to / is the structure 91 = {luI^^ ; + , Pq) whose 
domain is the set of sequences f : lo ^ uj such that f{i) ^ for finitely many values of i, -|- 
denotes the graph of addition of sequences f : lo ^ uj, and Po{f) holds only for / = 0. It 
follows from Mostowski's result that -FO(9T) is decidable since it reduces to FO(97t) which 
is decidable [25]. Now observe that the application h : lo^^^ — > \ {0} which maps every 

sequence / G lo^^ to the integer h{f) = 2^^^^3-l'''^^ . . . , defines an isomorphism between 91 
and the structure {uj \ {0}; x,Pi) where x denotes the graph of multiplication, and Pi{x) 
holds if and only if x = 1. Therefore the FO theory of the latter structure is decidable. 

Feferman and Vaught [14] generalize Mostowski's technique by allowing index structures 
of the form 6 = {S{I); C, 5i, . . . , Sm) where the Si's denote any relations. 

Definition 4.6. Let 97t = (S; . . . , R^) be a structure, / be a set, and let S = (5(/); C 
, Si, . . . , Sm) where the Si's denote relations. We call generalized powe^ ofTl with respect 
to S every structure of the form 

m={^';Pl,P2,...,Pn) 

such that all relations Pi admit a reduction sequence with respect to 971 and S. 

Theorem 4.7 (Feferman- Vaught [14J). If ^ is a generalized power o/97I with respect to 6, 
then Theorem \4.4\ holds for 97t, 91 and S . 

Our definition is a slight modification of tlie original definition. Indeed Feferman and Vaught define 
the generalized power of 9Jl with respect to 6 as the structure with domain and with infinitely many 
relations Pi, one for each relation which admits a reduction sequence. 
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Example 4.8. Let I = uj, and let & be the structure &uj = {S{uj); ^,^) where x <C y if 
and only if x and y are two singleton sets, say x = {m} and y = {n}, such that m < n. This 
structure is a FO version of the MSO theory of {uj; <), which was shown to be decidable 
by Biichi [8]. Let Tl = {lu;+). Consider the structure = where Pi{f) holds in 

if and only if there exist j (z uj such that f{i) = for every i > j. The structure 97 is a 
generalized power of 9Jl with respect to &. Indeed it is easy to check that Pi{f) holds in 97 
if and only if 

6 ^ 3XiVX2(Xi < ^2 ^ ^2 C T) 

where 

T = {ieio\Tl^ fii^ + f{i) = f{i)}. 
By Theorem 14. 7^ the FO theory of 91 is decidable, since the FO theories of dXt and S^^ are 
decidable by [25l|8]. 

Feferman and Vaught also define the notion of generalized weak power of a structure 
OK = (S; , . . . , iif, P^) with respect to some index structure & . This notion generalizes 
the one of weak power by allowing to deal with index structures of the form & = {S^{I); C 
, Si, . . . , Sn) where the S'j's denote relations over S~^{I). Feferman and Vaught prove that 
Theorem 14.41 still holds for generalized weak powers, with the same modifications as for 
direct weak powers. 

Let us consider the case where © is the structure S<a; = (S'"*'(w); C, ^). The FO 
theory of S<^ is a variant of the weak MSO theory of (w; <). In this case there is a close 
correspondence between relations which admit a reduction sequence with respect to &, and 
MS'O (9Jt)-definable relations, or equivalently SP7-recognizable relations (by Theorem I3.4p . 

Consider indeed an alphabet S, and the application fi which maps every finite word 
w over S to the a;— word n{w) = w^'^ over (S U {#})'^. The word fi{w) can be seen 
as an element of (S U {#})^'. Given an n— ary relation R over S*, we set ju(i?) = 
{{fi{wi), . . . , fi{wn))\{wi, . . . ,Wn) € R} ■ The relation fi{R) can be seen as a subset of the 
set of sequences / : w — > w U {#} such that /(i) ^ # for finitely many values of i, i.e. as a 

subset of (SU{#})|'\ 

Proposition 4.9. Let 971 = (E; . . . ) be a structure. For every n > 1 and every n—ary 
relation R over T,* , the relation R is M SO {^) -definable if and only if the relation fi{R) 
admits a reduction sequence with respect to 971^ and S<a;. 

We shall illustrate this proposition by some example, and leave the proof to the reader. 

Example 13.21 revisited. Let 9Jt = {uj; +), and let R denote the set of words w over uj that 
contain only even symbols. The relation R is MSO (9Jt)-definable by the sentence Vy aF{y), 
where F{x) : 3z{z + z = x). 

In this case ^j.{R) corresponds to the set of sequences / G (cj U {#})^^ such that 

• the set of indexes i such that f{i) 7^ # is an initial segment of oj; 

• for every i such that f{i) 7^ f{i) is even. 

This implies that ii{R) admits a reduction sequence with respect to 971:^ and 6<u;- Indeed 
we have / G ^(^) if and only if 

e<^ ^ vxvy((y c r a x < y) ^ x c r) a vy (y c r ^ y c r') 

where 

T = {ieuj\Tl#^ f{i) / #} 
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and 

r = {ieuj\m#^ 3z{z + z = f{i))} 

The second part of the above formula is a direct translation of the MSO sentence Vy apiy) 
used to prove the MSO (9Jl)-definability of R. It is not difficult to prove that a similar 
translation is possible for every MSO (9JI)-definable relation. 

Recall that by Proposition 13.41 AfS'0(9Jt)-definability and SDT-recognizability are equiv- 
alent. Proposition 14.91 allows to revisit our previous results in terms of powers of structures: 

• The closure of 9Jt-recognizable relations under boolean operations, projection and cylin- 
drification (Proposition I2.6( ) . could have been proven as a consequence of Proposition 14.91 
together with the closure under definability of relations which admit a reduction sequence. 

• If = {N;Ri, . . . ,Rn) is an 97t-automatic structure then by Proposition 13.41 and 14.91 the 
image by /i of the domain and base relations of admit a reduction sequence with respect 
to and S<t^. Thus every automatic structure is isomorphic to some relativized 
generalized weak power of with respect to S<a;. Moreover by Feferman-Vaught 
Theorem, FO(DT) reduces to FO{m#) and F0(6<^). Now one can reduce FO(Tl#) to 
FO(Tl), and FO(S<a;) is decidable by [7]. Finally this proves that F0(9^) reduces to 
FO{M), that is, Theorem EH 

• In the same way as 9Jt-automata over finite words correspond to generalized weak powers 
with respect to the index structure 6<w, one can define a notion of 9Jt-automata over 
w— words and show that it corresponds to generalized powers with respect to the index 
structure 6^;. 

5. Applications 

5.1. An application to monadic chain logic over iteration. We apply the previous 
results to improve a recent result of Kuske and Lohrey [19] . This application was brought 
to our attention by Wolfgang Thomas. 

In |19j the authors consider decidability issues related to monadic second-order chain 
logic, and applications to pushdown systems. Given a structure 21 = {A; <,...) where 
< denotes a partial ordering, the monadic second-order chain logic of 21, which will be 
denoted by MS'O'^'^ (21), is the fragment of the MSO theory of 21 where monadic second 
order quantifications are restricted to chains (i.e. linearly ordered subsets) with respect to 
<. The logic MSO'^^ was first investigated in [30] . 

Consider a ^-structure dJl = (S; Ri, . . . , Rm)- The basic iteration ofdJt is the structure 

miL = (S*;^,ifi„...,ifi^) 
where, for every relational symbol Rj with arity n, 

^Rj = {{uai, . . .,uan) | u G S*, (oi, . . . ,a„) G Rf^} 
Kuske and Lohrey prove the following result. 

Theorem 5.1. [19^ Theorem 4.10] For every structure dJl, the MSO'^^ theory ofdJll^ reduces 
to the FO theory of Tl. 

We can improve this result by replacing the structure SUl^^ with the structure which 
we introduced in Section [3?2l 
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Theorem 5.2. For every structure Tl = (S; . . . , the MSO theory of Sm = 

{T,*; EqLength, Ajij^, . . . , Aji^, A=) reduces to the FO theory ofDJt. 

Observe that all predicates Ar. can be defined in S<xn- 

Proof. The proof consists in two main steps. The first one is to reduce the MSO'^^ theory of 
iSgjt to the FO theory of some structure S'^ with domain S"^. The technique is an adaptation 
from |31j, Section 4]. The second step consists in proving that 5^ is a generalized power 
with respect to 9Jt and G^, which allows then to use Feferman-Vaught Theorem and the 
decidability of FO{&i^) to conclude. 

We first consider a variant of MSO'^^ of where only second-order variables occur; 
this can be done by introducing the inclusion predicate Xi C X2, and replacing relations 
between elements by the corresponding relations between singleton sets. 

Now every chain X of elements of S* can be represented by a couple (u, v) of elements 
of S*^ in the following way: 

• u corresponds to the "direction" of the chain X, i.e. is such that all elements of X are 
prefixes of u (note that if X is infinite then there exists a unique such u) ; 

• V indicates which prefixes of u belong to X, in the following way: for every integer i, we 
have v[i] = u[i] if and only if the prefix of u of length i belongs to X. 

According to this definition, any couple (n, v) of elements of S'^ represents a single chain 
which will be denoted by ch{u,v). 

The previous encoding allows to reduce the MSO^^ theory of S^m to the FO theory of 
the structure: 

S^jji={^'^;^,C', EqLength', ^',A'^^,..., A^J 

where 

• Rr: (til, vi, U2, V2) holds if and only if ch{ui, vi) = ch{u2, V2); 

• ^' {ui,vi,U2,V2) holds if and only if ch{ui,vi) C ch{u2,V2); 

• EqLength' {ui,vi,U2,V2) holds if and only if there exist two words u,u' G S* such that 
ch{ui,vi) = {u}, ch{u2,V2) = {u'}, and 5srrt |= EqLength{u,u') ; 

• ^' (^^ii ^^ii ^2, V2) holds if and only if there exist two words u, u' G S* such that ch{ui, vi) = 
{u}, ch{u2,V2) = {u'}, and \= u ^u'; 

• For every i, if Ri is a n— ary relation, then A'j^^{ui,vi,U2,V2, ■ ■ ■ ,Un,Vn) holds if and 
only if there exist words wi, . . . ,Wn G such that ch{uj,Vj) = {wj} for every j, and 

\= AR^i'Wl, . . .,Wn). 

• AL{ui,vi,U2,V2) holds if and only if there exist two words u, v! G S* such that ch{ui,vi) = 
{u}, ch{u2,V2) = {u'}, and Sm \= A={u,u'); 

We shall prove that 5^ is a generalized power of Tl with respect to S^;. To this aim, 
let us prove that all base relations of 5^ admit a reduction sequence with respect to 9Jt 
and So;: 

• ^' (^1; ^^1; ^2, ^2) holds if and only if 

h Ti C r2 A 3X{IS{X, Tg) A Ti C X) 

where 

Ti = {i euj \ ui{i) = vi{i)}, 
T2 = {i G [ U2{i) = V2{i)}, 
Ts = {i G w I ui{i) = U2{i)}, 
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and IS{X,T^) is a formula which expresses that X is the greatest initial segment of u 
which is contained in T3 (this is expressible in &uj)', 

• ^ {ui,vi,U2,V2) holds if and only if both C' (tti, vi, «2, ^2) and C' (^2, ^2, ui, f 1) hold, 
from which we can deduce a reduction sequence for the relation «; 

• EqLength'{ui,vi,U2,V2) holds if and only if Ti and T2 are singleton sets and Ti = T2, 
i.e. if 

&^^3Y{Ti «y) ATi =r2 
(with the same notations as above); 

• ^' {ui,vi,U2,V2) holds if and only if 

e,^ ^ 3y(Ti < y) A (Ti = r2 v Ti < r2) a 3X(/5(x,T3) a Ti c x); 

• for every i, if i?j is a n— ary relation, then A'j^\ui,vi,U2,V2, ■ ■ ■ ,Un,Vn) holds if and only 
if all sets Uj = {i & uj \ Uj{i) = Vj{i)}, j = 1,2, ... ,m, are singleton sets and are equal, 
and are included in the set 

U = {i £uj \Tl\= Rj{ui{i),U2{i), . . . ,«„(?))}. 

These properties can be expressed in S^. 

• the case of AL{ui,vi,U2,V2) is similar to the previous case, with n = 2 and = in place of 
Rf, 

We have proved that S!^ is a generalized power of 9H with respect to S^^. By Theorem 
WTh FO{S'^) reduces to the FO theories of M and S^. Now FO(e^) is decidable by Biichi 
[8], thus FO{Sijj^) reduces to F0(9Jl). □ 



5.2. Ordinal addition and (uj; +)-recognizability. We shall focus now on the case 971 = 
{uj;+), where + denotes the graph of addition. In this case we present another logical 
characterization of 9Jt-recognizable relations in terms of ordinal theories. This is essentially 
a reformulation of known results. 

In the sequel we consider structures of the form (a; +) where a is an ordinal. The 
domain is the set of ordinals less than a, and + is interpreted as the graph of ordinal 
addition restricted to the domain. 

Feferman and Vaught prove in [TJ] that for every ordinal 7 the structure {uj'h +) is 
isomorphic to some generalized weak power of {uj; +) with respect to {S~^{'j); C, ^j^. 

In particular for 7 = 0; their result, combined with Biichi's result, implies that via some 
encoding all relations definable in are (w; +)-recognizable, and that the theory of 

(w'^; +) is decidable. 

Let us be more specific. We first recall some useful results on ordinal arithmetic; all of 
them can be found e.g. in Sierpinski's book [291 chap. XIV] 

Proposition 5.3 (Cantor normal form for ordinals). Every ordinal a > can be written 
uniquely as 

a = uj^^ai + • • • + uj^'^ak 
where qi, 02, . . . , Ofc is a decreasing sequence of ordinals, and < Oi < uj. 

"^As a corollary, the FO theory of {uj'';+) reduces to the FO theory of (Presburger Arithmetic, 

which is decidable [25]) and the weak MSO theory of (7, <). The latter was proved to be decidable by Biichi 
[S] a few years after Feferman- Vaught' work, which implies the decidability of the FO theory of (tj^; +). 
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The following proposition relates the Cantor normal form of the ordinal q + /3 to the 
one of a and /?. 

Proposition 5.4. Let a = w^^ai + • • • + w"'-'afc and /? = uj^^h\ + • • • + a;'^'6/ he two ordinals 
> in Cantor normal form. 

• If ai < (3i then a + (3 = fi 

• If Oil > /?! and if aj = (3i for some j, then 

a + p= {co^'ai + ■■■ + co^^-'aj^i) + co'^^ {aj + 6i) + (cu^^^^ + . . . + u;Hi) 

• If Oil > Pi and if aj ^ (5i for every j, then 

a + 13= {u'^^ai + ■■■+ u""^am) + (uj'^'h + ■■■+ uHi) 
where m is the greatest index for which Um > Pi ■ 

Consider now the function / : w'^ — > a;* which maps every ordinal a < lv^ , written 
in Cantor normal form as a = X]i=m'^*'^* with ai < lo and am / 0, to the word c{a) = 
aQ . . . am over the alphabet to. Given n ordinals ai,...,a„, we define c(ai,...,a„) as 
(c(ai), . . . , c(a„)), where we choose as the padding symbol 

Example 5.5. Consider the ordinals 

a = • 5 + cj"^ • 4 + • 3 + • 2 + • 11, 

and 

/? = ■ 17 + • 6 + • 2. 
Then by Proposition 15.41 (second case), the ordinal 7 = a + /? equals 

J = (uj^ -5 + 00^ ■ 4) +u;^ ■ {3 + 17) + {0;"^ -e + uj^ ■ 2). 

We have 



c(a,/5,7) = 




Proposition 15.41 is the key argument in Feferman-Vaught' proof that {uj"'; +) is iso- 
morphic to some generalized weak power of with respect to (5'"'"(7); C, ^). Let us 
reformulate their ideas in terms of (uj; +)-automata. 

Proposition 5.6. The image by c of the graph of addition for ordinals < lo^ is {uj;+)- 
recognizable. 

Proof. A convenient (a;; +)-automaton which recognizes the language 

X = {c(q, /3, 7) I Q, /3, 7 < w'^, a + P = 7} 
is pictured in Figure [21 where 

• ^iix,y,z) : z = y 

• ip2{x, y,z) : y=^OAz = x + y 

• V3{x,y,z) : y = OAz = x 
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Figure 2: An (w; +)-automaton for ordinal addition 

This automaton has two states qo,Qi- Both are initial states, and only qi is final. Using 
Qi as the initial state allows to deal with the case /3 = 0; in this case we have 7 = 0, which 
is checked by the transition labelled by (^3. 

Using qq as an initial state allows to deal with the case /3 7^ 0. In this case let denote 
the greatest power of co which appears in the Cantor normal form of /3. The transition 
labelled by ipi allows to deal with coefficients of powers uj^ where i < (3i; for these powers 
the corresponding coefficients of /? and a + /3 must be equal. The transition labelled by 922 
corresponds to the power uj^^ . Then for all powers co^ such that j > /3i, the corresponding 
coefficients of a and a + f3 coincide; this corresponds to the transition labelled by 993. □ 

We can provide now a characterization of 97t-recognizable relations for the case Tl = 

Proposition 5.7. For every n > 1, and every n—ary relation R over uj^ , the relation R is 
definable in (a;'^; +) if and only if c{R) is -recognizable. 

Proof, (sketch) The "only if part comes from the fact that the range of c, as well as the 
graph of ordinal addition, are (w; +)-recognizable. Thus (a;'^;+) is (w; +)-automatic, and 
the result follows from Theorem 13.91 

For the converse one can use again Biichi's encoding technique as in Theorem 13.121 
Assume that c{R) is (w; +)-recognizable by some (w; +)-automaton A whose set of states is 
Q = {^O; ^Zi) • • • ) ?m}- We can define R in {oj^; +) by a formula (p[ai , ■ ■ ■ , which expresses 
the existence of a successful path of A, say {qjg,qj^, . . . , qj^), labelled by c(ai, . . . , a„). The 
formula encodes the path with an ordinal of the form 7 = w^jVn + ^^~^jm-i + • • • + i^^jo- 

We need to define the following auxiliary predicates (we explain briefly how to define 
them in (w'^; +)): 

• a < P (we have a < /3 if and only if there exists some non-null ordinal 7 such that 

/3 = a + 7); 

• the function (xi, . . . 1— > max(a;i, . . . ,Xn); 

• "to be a limit ordinal less than u;'^" (these are non-null ordinals which have no predecessor 
with respect to <); 

• Pow{x) which holds iff x is a power of uj less than uo"^ (which holds iff x is a limit ordinal 
and there do not exist limit ordinals /9,7 such that x = (5 + ^ and 7 < 

• For every i < to, the relation Multi{x) which holds iff x an ordinal of the form uj^ ■ i 
(easily definable with the predicate Pow{x)); 

• the function x 1— > xoo (for x 7^ 0, the ordinal xoo is the least power of w greater than x); 

• App{x, y) which holds iff y is a power of uj which appears in the Cantor normal form of x 
(this holds if and only if Pow{y) holds and moreover there exist ordinals /32 such that 
x = Pi+y + (32 and (32 < y); 
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• AddCoef{x,y,z) which holds if and only if there exist i,j,k < uj such that x = uj^i, 
y = uj^j and z = uj^{i + j) - which is equivalent to saying that z = x + y and there exists 
exactly one ordinal a such that App{x, a) A App{y, a) A App{z, a) holds. 

• Term{x,y, z) which holds iff z is a power of a;, say z = oj^, y = oo^i for some i < oj, 
and y is the term which corresponds to a;^ in the Cantor normal form of x. The relation 
Term{x, y, z) holds if and only if Pow{z) holds, z is the only power of w which appears 
in the Cantor normal form of y, and there exist ordinals /3i, /32 such that x = (3i + y + (32 
with [32 < y, and z do not appear in the Cantor normal forms of (3i and /?2- 

• For every formula ^p{xi, . . . , Xn) in the language {+,=} one can define the predicate 
Sjjjiyi, • • • , yn, z) which holds if and only if z is a power of oj, say z = uj^ , and if we denote 
by ai, . . . , On the coefficients of uj^ in the Cantor normal forms of yi, . . . , y„, respectively, 
then (w; +) |= ■(/'(oij • • • ) On)- The predicates can be defined from Term and AddCoef 
by induction on the construction of ip. 

• Codi{x,y) expresses that y is a power of lo and the coefficient of y in the Cantor normal 
form of x equals i. This predicate is easily definable from the predicates Multi and Term. 

Finally we can define the formula (/j(qi, . . . , a„) as 

37 ^7 < max(ai, . . . , a„) ■ u; (5-1) 
A \l Codi(ri,uj'') (5.2) 

A{y(3{{Pow{l3) AP < max(Qi,...,a„) -w) — > (5.3) 
y (Codi(7,/5) A5^(ai,...,a„,/3) AC7odj(7,/3cj))) (5.4) 

A \f Codj(7,max(ai,...,a„) -tj) j (5.5) 

Line 15.21 states that the first state of the sequence of states encoded by 7 is an initial state; 
lines 15.3] and 15.41 that consecutive states in the sequence use transitions of the automaton, 
and line 15.51 that the last state of the sequence is terminal. □ 

Remarks 5.8. 

• One can prove that the graph of 2; 1-^ ujx is not 9Jt-recognizable, either in a direct way, or 
using the fact that by [11] the theory of {oJ^\ +, x 1— > wx) is undecidable, while the theory 
of (cj'^; +, x 1-^ xijj) is decidable since the function x ^ xuj is definable in {uj^; +) which 
has a decidable theory. 

• We could reformulate the above results by replacing (uj^; +) by the structure {iv; x, <p), 
where x <p y holds if and only if x < y and x, y are prime numbers. In this case we encode 
every word n = . . . over the alphabet uj by the integer c'{u) = 2"o+i3'^i+i . . 
where pn denotes the n— th prime number. We refer to [22| for details about the link 
between (a;'^;+) and {lo; x,<p). 
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6. An EXTENSION OF THE FeFERMAN-VAUGHT FORMALISM 

The automata and logic that we introduced in the previous sections do not allow com- 
parisons between symbols from different positions. For instance, for every structure 971 
whose domain is infinite, the language {ss \ s € |M|} is not MSO (9Jt)-definable (see Ex- 
ample ETS]). More generally, given any formula ip{x,y) in the language of 9Jt, the language 
{siS2 I \= if{si,S2)} is not in general AfS'0(97T)-definable. 

A natural way to add expressive power is to extend MSO with predicates such as 
P{x, y) interpreted as "x, y are two positions in w such that w[x\ = w[y\\ or more generally 
predicates interpreted as "x, y are two positions in w such that 9Jt# \= (p{w{x\,w[y]y'' (where 
if is some £#-formula). 

However these extensions do not add expressive power when 971 is finite, and lead to 
undecidable theories in case 971 has an infinite domain (we refer the reader e.g. to [3] where 
it is shown that much weaker related formalisms have undecidable FO theories). 

Thus in order to get decidability results we have to restrict the use of these new pred- 
icates. Below we describe a syntactic fragment for which the satisfiability problem still 
reduces to the decidability of the first-order theory of 97t. 

Given an ^-structure 971 = (S; . . . ), we associate to every formula F with m free 
variables some (new) m— ary relational symbol Op- 

Definition 6.1. We define MS'0+(£) as MSO over the language {<,{eF)F(Lr} where F 
denotes the set of formulas with at least one free variable. 

The interpretation of M50+(£) sentences is similar to MS'0(£), but for every 11^- 
formula F with m free variables the interpretation of 9f{xi, . . . ,Xm) is "the positions 
xi, . . . , Xm in the word w satisfy 97t^ |= F{w[xi\, . . . , i(;[xm])". 

Definition 6.2. We say that X C S* is MS'0+ (97r)-definable if there exists an M50+(£)- 
sentence (p which defines X. The definition can be extended easily to the case of subsets 
X C (S*)". 

Note that if one allows only M SO~^ {2^) sentences where the predicates Op are unary, 
we get nothing but MSO{£). 

Example 6.3. Let 97t = (w; +). 

• The language X C. u)* oi words u over oj such that some symbol s € S appears at least 
twice in u is MS'0+ (97t)-definable by the MS'0+ (£)-sentence 

3x3?/(x < y A 9F{x,y)) 

where 

F{xi,X2) : xi = X2- 

• The language X' C u* of words over lv of the form u = sq . . . Sm such that there exists 
j € {0, ...,m} such that Sk > 2sj whenever k > j, is M^O"*" (97t)-definable by the 
MS'0+ (£)-sentence 

3x{3x'{x < x ) A V?/(x < y ^ &g{x, y)) 

where 

G(xi, X2) : 3z(x2 = xi + xi + z) 
denotes the formula which expresses that X2 > 2xi. 
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Example 6.4. Let 9Jt = (Y,*;EqLength,^,{La}a£T,) denote the EES structure S (see 
Section I3.2p . The set of words w = sq . . . Sm over the infinite alphabet F = S* such that 
all even positions carry the same symbol, and all odd positions carry a symbol which is a 
prefix of sq, is (9Jt)-definable. Indeed a convenient MS'O"^ (£)-sentence is 

3X [EvenPositions {X)A 

A3x £ X\/y £ X 0Fi{x,y) A 3z(Vt < z A^y ^ X ep^iy^z))] 

where EvenPositions{X) is an MSO-formula which expresses that X consists in the set 
of even positions of and 

Fi{vi,V2) : vi=V2; 

F2{vi,V2) ■■ Vi ^ V2. 

The formalism MSO^{£) is in general too expressive with respect to decidability, thus 
we have to consider a syntactic fragment of it. 

Definition 6.5. We define MSO^{£,) as the syntactic fragment of MSO^{£,) consisting 
in formulas of the form 

3X1 .. . 3x„ (p{xi, ...,Xn) 
where (p is an M^O"^ (£)-formula which satisfies the following constraint, which we denote 
by (*): all predicates of the form 9p in ip have the form 6f{xi, . . . , x„, y), i.e. contain at 
most one free variable distinct from the x[s. 

Note that formulas considered in Examples 16.31 and 16.41 are MSO]^ (£)-formulas. 

Theorem 6.6. The emptiness problem for MS 0^{^)- definable languages reduces to the 
decidability of the FO theory of 9Jt. 

Proof. Let S be the domain of 971. To each MS'O J (£)-sentence ij: of the form 

3X1 .. . 3x„ if{xi, ...,Xn) 

where (/? satisfies (*) we associate in an effective way an MSO (£')-formula ip' where £' 
is obtained by adding to £ new constant symbols ci,...,Cn, in order that for every £- 
structure 9Jl, the sentence ■0 is satisfiable by some word over S if and only if there exists 
some expansion 971' of 971 such that the the set of words over S defined by ip' is not 
empty. 

The transformation proceeds as follows. First, we can assume that all formulas of the 
form 9f{xi, . . . , x„, y) which appear in ip are such that y appears freely in 6p: indeed if y 
does not appear in Bp then 6f{xi, . . . , Xn) is equivalent to 3y(y = x„ A 9fi{xi, . . . , x„_i, y)) 
where F' is obtained from F by substituting y for x^. 

We define the M50 (£')-formula V' as 

3x1 .. . 3x„ {/\ ap.ixi) A ip'{xi, . . . ,x„)) 

i=l 

where Fi{y) denotes the formula y = Ci and ip' is obtained from p by replacing every 
formula 9f{xi, . . . ,Xn,y) by the formula api^y) where F' is obtained from F by replacing 
every occurence of Xj by the constant symbol Cj. 

It is easy to check that for every ^-structure 971, '0 is satisfiable by some word model 
over S if and only if there exists some expansion 971' of 97T such that L^tp') ^ 0. 
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The formula ip' involves the predicates api, ■ ■ ■ ,otF„, and also predicates of the form a f 
which appear in if' , say , • • • , ctFp • By Proposition l3.4l given 971' the question of whether 
the language defined by V'' is empty reduces to decide emptiness for the corresponding 
SUt'— automaton. This amounts to compute the set i^gjt' of subsets I C {1, . . . ,p} such that 
there exists a E S such that |= Fi{a) if and only if z G /). Thus it suffices to compute 
all possible sets E^i for all expansions OJt' of 9Jl. This can be done effectively since for 
every subset E of subsets of {1, ... one can find an ^-sentence He such that 9Jt |= He 
if and only if there exists some expansion 971' of 97T such that i^rrjt' = Therefore we 
reduced our initial problem to the question of whether 97t satisfies some sentence. □ 

7. Discussion and conclusion 

The proof of Theorem 16.61 makes uses of Biichi's decidability result for the WMSO 
theory of (w; <). However the arguments are sufficiently general to apply to any decidable 
extension of WMSO. An interesting example is the WMSO theory Tcard of without 
<, but with the predicate X ^ Y interpreted as "X and Y have the same cardinality". 
This theory was proven to be decidable by Feferman and Vaught in [14j by reduction to 
Presburger Arithmetic (by elimination of quantifiers, and without using the composition 
technique). For recent applications of this decidability result we refer the reader to the 
papers [iHl ESl ED] . 

One can show that Theorem 16.61 holds with T^ard^ which provides a class of the- 
ories which are both decidable and quite expressive. As an example, if we set 97t = 
{T,*; EqLength, ^, {La}aeT;) (the EES structure, whose FO theory is decidable [l3]), then 
the corresponding syntactic fragment allows to express properties related to finite words 
w over the alphabet S' = S* (that is, finite sequences of words over S) such as "there 
exist two distinct symbols s, s' appearing in w such that at least one third of the symbols 
in w are prefix of s, or have the same length as s'". Another interesting example is the 
case 971 = {to; +). In this case we obtain a decidable fragment for words over the alphabet 
w, i.e. lists of natural numbers. This fragment might be an interesting formalism for the 
verification of programs which manipulate pointers and linked data structures. 

By Proposition 13.41 971-automata capture the logic MS'0(£). Thus a natural issue is to 
get an automata counterpart for the logic MSO~^{£). An idea is to consider 97t-automata 
equipped with a finite number of "write once" registers. In addition to the usual transitions 
of 971-automata, these automata are allowed to write the current symbol in some empty 
register, and test whether the symbols currently stored in the registers and the current 
symbol satisfy some sentence in 97t#. Once a symbol is stored in some register, the 
automaton cannot store any other symbol in this register. In order to capture the fragment 
MSO^{2), it seems that one should also allow non-deterministic e— transitions where the 
automaton chooses to store some symbol from the input alphabet in some (empty) register. 

Another interesting issue would be to find (more natural) extensions of the Feferman- 
Vaught formalism in the spirit of Theorem 16. 6[ The formalism MSO~^{£) allows the use 
of predicates 9f for all formulas F, which makes necessary to consider the fragment 
MSO^{2) in order to get decidability results. It would be interesting to find other fragments 
of M SO~^ {2) obtained by imposing conditions on the formulas F. One can consider 
e.g the case where we allow only the use of formulas F which define equivalence relations 
in 97T. Note that similar results are already proven in the papers Ol^. 



22 



ALEXIS BES 



Finally, it seems that all results in this paper can be extended rather easily to the case 
of infinite words as well as (in)finite binary trees, by relying on classical decidability results 
for MSO theories. 
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